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Abstract 

Background: To investigate the effect of physician- and patient-specific feedback with benchmarking on the 
quality of care in adults with type 2 diabetes mellitus (T2DM). 

Methods: Study centres in six European countries were randomised to either a benchmarking or control group. 
Physicians in both groups received feedback on modifiable outcome indicators (glycated haemoglobin [HbA lc ], 
glycaemia, total cholesterol, high density lipoprotein-cholesterol, low density lipoprotein [LDL]-cholesterol and 
triglycerides) for each patient at 0, 4, 8 and 12 months, based on the four times yearly control visits recommended 
by international guidelines. The benchmarking group also received comparative results on three critical quality 
indicators of vascular risk (HbA lc , LDL-cholesterol and systolic blood pressure [SBP]), checked against the results of 
their colleagues from the same country, and versus pre-set targets. After 12 months of follow up, the percentage 
of patients achieving the pre-determined targets for the three critical quality indicators will be assessed in the two 
groups. 

Results: Recruitment was completed in December 2008 with 3994 evaluable patients. 

Conclusions: This paper discusses the study rationale and design of OPTIMISE, a randomised controlled study, that 
will help assess whether benchmarking is a useful clinical tool for improving outcomes in T2DM in primary care. 

Trial registration: NCT00681850 



Background 

Physicians in Europe and throughout the world are 
diagnosing and treating a growing number of patients 
with type 2 diabetes mellitus (T2DM). The International 
Diabetes Federation estimates that approximately 55 
million people in Europe had diabetes in 2010 [1]. Over 
90% of these are patients with T2DM, and management 
of its associated complications already requires a signifi- 
cant level of support in terms of financial costs and 
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healthcare utilisation [2]. In 2009 the healthcare expen- 
diture for diabetes was estimated to account for 10% of 
the total health budget in Europe [1] and this financial 
burden, together with the accompanying social burden 
associated with reduced quality of life and productivity 
caused by the condition, its numerous comorbidities 
and premature death, is undoubtedly set to rise over the 
next decade. 

Clinical research has shown that long-term complica- 
tions of diabetes can be delayed or even prevented when 
management by the healthcare team and the patient's 
self-management are optimised [3-5]. However, the 
apparent inability to effectively translate this scientific 
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evidence into real-life clinical practice represents a 
major barrier to reducing the burden of diabetes com- 
plications [6,7]. 

Western healthcare systems are currently in a transi- 
tion phase; shifting towards organisational structures 
that can deliver improved diabetes care specifically, and 
chronic care in general [8,9]. T2DM is often used as 
'the' test case for new models of care. The assumption is 
that if an intervention works in as complex a disease as 
T2DM, there is a fair likelihood that it will be effective 
in other chronic conditions. Because the majority of 
chronic illness care is performed within the primary 
care setting, redesign of primary care has been proposed 
in an attempt to close the gap between current practices 
and optimal standards [10]. How such a redesign might 
be accomplished is the subject of ongoing research and 
socio-economic debate, and in this process the Chronic 
Care Model is often used as a conceptual framework 
[11-13]. This model identifies six essential elements that 
impact upon chronic care: (i) community resources and 
policies, (ii) healthcare organisation, (iii) self-manage- 
ment support, (iv) delivery system design, (v) decision 
support, and (vi) clinical information systems [12]. 
Initial evidence demonstrates that the Chronic Care 
Model can improve chronic care and, in some cases, 
reduce healthcare costs [8,9,13]. However, more 
research is needed to substantiate the relative effective- 
ness of the components of this model. 

One element of the Chronic Care Model that has not 
been studied extensively, but which nevertheless may 
offer a realistic opportunity to improve chronic care, is 
the use of clinical information systems. These include 
reminder systems that help primary care teams comply 
with practice guidelines, and feedback to physicians 
showing how they are performing with regard to quality 
of care indicators. Practice recommendations (which 
may be based on national or international guidelines) 
are generally used as a reference for comparison when 
providing feedback. 

Benchmarking, a technique used for quality manage- 
ment in industry since the 1980s, has only recently been 
introduced into the clinical setting. It consists of mea- 
suring the clinical performance of the healthcare deliv- 
ered by an individual clinician or a clinic over a 
specified period (termed the audit), comparing it with 
the performance of a peer group or a group of similar 
clinics (benchmarking), and transmitting this compari- 
son back to the clinician or clinic (benchmarked feed- 
back) (Figure 1). Benchmarking therefore encompasses 
an intellectual, emotional and/or competitive stimulus 
for change. It also offers an opportunity to break 
through the isolation that many primary care physicians 
encounter, as they are not routinely able to compare 
their performance with that of their peers. 



Example: BP <1 30/80 mmHg for patients with diabetes 



Centre X - your results: 40% 




i i i r 



IQED = Initiative for the promotion of quality and epidemiology of diabetes mellitus 

Figure 1 Example of benchmarking and feedback process. 



It would appear to be intuitively logical that if health- 
care professionals were given feedback that their clinical 
practice, and therefore their patient outcomes, was 
inconsistent with that of their peers or accepted guide- 
lines, they would modify their attitude and practice. 
However, there are few published reports of randomised 
studies investigating this approach in T2DM in primary 
care, and the effectiveness of feedback and benchmark- 
ing has not been definitively proven in a randomised, 
controlled setting. OPTIMISE (OPtimal Type 2 diabetes 
Management Including benchmarking and Standard 
trEatment) was designed as a randomised controlled 
trial assessing the effect over a 12-month period of phy- 
sician- and patient-specific feedback with benchmarking 
on the quality of primary care in T2DM patients from 
six European countries. The aim of the current paper is 
to describe the rationale for, and methodology used in 
the OPTIMISE study, including details of the bench- 
marking and feedback process. 

Methods 

The original protocol developed in Belgium was subse- 
quently adapted for extension to five other European 
countries: Greece, Luxembourg, Portugal, Spain and the 
UK. Patient recruitment began in March 2008 in Bel- 
gium, and in September 2008 for the other countries. 

Patients 

All patients with T2DM (insulin treated or not), aged 
>18 years were eligible for the OPTIMISE study. Exclu- 
sion criteria were patients with type 1 diabetes, gesta- 
tional diabetes, patients who were hospitalised for 
diabetes regulation or who were members of the Belgian 
Diabetes Convention (i.e. hospital-based diabetes centres 
already participating in a quality assurance programme 
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with benchmarked feedback). The definition of diabetes 
used was a fasting plasma glucose test of > 126 mg/dL. 

Study design 

This randomised controlled trial was performed in 
accordance with the Declaration of Helsinki and ICH/ 
Good Clinical Practice and applicable regulatory 
requirements [14]. The study was sponsored by Astra- 
Zeneca. Written, informed consent was obtained for all 
patients before initiation into the study. The OPTIMISE 
study is registered at ClinicalTrials.gov (NCT00681850). 

The aim of the study was to assess the impact of 
benchmarking on the quality of T2DM care, by deter- 
mining, as the primary objective, the percentage of 
patients in the benchmarking group achieving, after 12 
months of follow up, pre-set targets of three critical 
quality indicators (glycated haemoglobin [HbA lc ], low- 
density lipoprotein [LDL] -cholesterol and systolic BP 
[SBP]) versus the control group. The null hypothesis 
was that the percentage of patients at target after 12 
months of follow up in the benchmarking and the con- 
trol groups would be equal. Secondary objectives 
included determining the percentage of patients achiev- 
ing the pre-set targets of the critical quality indicators 
and glycaemia versus baseline; determining the percen- 
tage improvement of the critical quality indicators and 
glycaemia versus baseline; and to follow up evolution 
markers of preventative screening (retinopathy, neuropa- 
thy, dietary counselling, microalbuminuria, smoking 
habits, body mass index [BMI], and physical activity). 

Centres or sites were selected on the basis of having 
sufficient patients with diabetes in their practice and 
with investigators that were motivated to fulfil the 
administrative procedures linked to the study. Centres 
or sites were randomised by a central randomisation 
procedure to either a benchmarking group or a control 
group. To ensure that country-specific treatment prac- 
tices were reflected in the study population, the centres 
or sites involved in the management of diabetes that 
were recruited, consisted of general practitioner or hos- 
pital-based outpatient clinics. Recruitment for the study 
began initially in Belgium with a randomisation ratio of 
1:1. Each centre was expected to recruit 8-12 patients; 
however, some centres enrolled 15-20 patients. The 
adapted sample size analysis showed that a power of 
79% was achieved for the parameter blood pressure and 
a significance level of 0.05 was achieved for the para- 
meters HbA lc and total cholesterol. As sufficiently high 
numbers of physicians were recruited in Belgium, this 
subsequently allowed a 3:1 randomisation ratio of 
benchmarked to control practices in the other countries, 
which was deemed more attractive to the participating 
physicians. All physicians involved in the study contin- 
ued with the routine monitoring, treatment and 



counselling of their patients with T2DM. Over the 
course of the study, physicians in both groups received 
the results of modifiable outcome indicators (HbA lc , 
fasting glycaemia, total cholesterol, high-density lipopro- 
tein [HDL] -cholesterol, LDL-cholesterol and triglycer- 
ides [TG]; determined in a central laboratory). 
Physicians in the benchmarking group additionally 
received information on the level of control of the criti- 
cal quality indicators of their individual patients, anon- 
ymously compared with the results of their colleagues 
from the same country, and compared with target levels 
(Figure 2). This benchmarked feedback was provided 
every 4 months, corresponding to three- to four-times 
yearly control visits, as recommended by international 
guidelines [15]. The study recruitment and initiation 
visit was defined as Visit 1 out of the four visits. 

Patient evaluation and quality indicators 

Each patient was followed for 12 months. Overnight 
fasting levels of HbA lc , glycaemia, total cholesterol, 
HDL- and LDL-cholesterol and TG were determined at 
each visit. Blood samples were sent to a central labora- 
tory (Bio Analytical Research Corporation [BARC], 
Ghent, Belgium) for analysis. HbA lc was measured 
using ion-exchange high performance liquid chromato- 
graphy (HPLC) (Adams A lc HA8160, Arkray, Kyoto, 
Japan). LDL-cholesterol data from evaluable patients 
was estimated using Friedewald's formula, which uses 
total cholesterol, HDL-cholesterol and fasting TG when 
the latter are < 400 mg/dL. Urine was analysed for 
microalbuminuria at Visits 1 and 4 using an immuno- 
turbidimetric assay (Modular P analyser, Roche Diagnos- 
tics, Mannheim, Germany). 

Casual (resting) BP, waist circumference, weight and 
height were measured by the physician at each visit. BP 
was measured using a manometer with an appropriately 
sized cuff with the patient in the sitting position for at 
least 5 minutes prior to the reading. A mean value was 
determined from three successive readings and 
recorded. Waist circumference was measured using a 
measuring tape placed around the patient's abdomen at 
the level of the iliac crest and at the end of normal 
expiration. Patients wore light clothing and no shoes 
during weight measurement - values were rounded up 
to the nearest kg from 0.5 kg. Patients wore no shoes 
during height measurement - values were rounded up to 
the nearest cm from 0.5 cm. 

A history of proteinuria at baseline was established 
from proteinuria levels in the patient's medical records 
and computed in the electronic Case Report Form 
(eCRF). Smoking status was also recorded in the eCRF 
as smoker/ex-smoker/non-smoker, together with the 
number of cigarettes, cigars or pipes that the patient 
was smoking, or had smoked per day. In addition, 
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Group 1: Control group 
Visit 0 



Visit 1 




Group 2: Benchmarking group 
Visit 0 



Visit 1 




Figure 2 Study flow chart. CRF, case report form; IC, informed consent. 



records were kept of whether or not a needs assessment 
for low-dose aspirin therapy (as an antiplatelet agent) 
had been performed during the 12 months of the study, 
as at the time the protocol was written, this was a 



recommended parameter to be taken into consideration 
for patients with T2DM [16]. Optional assessment by 
the treating physician at each visit of the degree of phy- 
sical activity could be carried out using a step counter. 
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A four-point verbal rating scale (VRS) was used to 
assess total physical activity with the following criteria; 
no weekly activity; only limited physical activity during 
most weeks; intense physical activity (activity that gives 
rise to shortness of breath, tachycardia and sweating) 
during at least 20 minutes, once to twice a week; intense 
physical activity (activity that gives rise to shortness of 
breath, tachycardia and sweating) during at least 20 
minutes, >3 times a week. 

Critical quality indicators used for benchmarking were 
HbA lc , LDL-cholesterol and SBP. Secondary quality 
indicators, not used for benchmarking, were blood glu- 
cose, TG, total and HDL-cholesterol, diastolic blood 
pressure (DBP), BMI, waist circumference and smoking 
status. Whether or not dietary advice had been given 
(and by whom), and whether screening for retinopathy 
and foot examination had been performed during the 12 
months of the investigation, were also recorded and 
analysed as secondary quality indicators but were not 
part of the benchmarking feedback. 

Target setting for benchmarking 

The targets set for the critical quality indicators (HbA lc , 
LDL-cholesterol and SBP) used for benchmarking were 
based on international guidelines [16-18], and, when 
deemed necessary, adapted by the study steering com- 
mittee in order to be consensual to the participating 
countries (Table 1). 

For physicians in the benchmarking group, bench- 
marked feedback was given to each practice for both the 
pooled patient population at that practice, and at an 
individual patient level (Figure 3). A bi-modal response 
was given (good, too high) for each of the critical quality 
indicators, signifying whether or not the patient/clinic 
had achieved targets. In Belgium, the National Steering 
Committee opted for a tri-modal response: HbA lc <7.0% 
= excellent, 7.1-7.5% = borderline, > 7.5% = too high; 
SBP < 130 mmHg = excellent, 130-139 mmHg = bor- 
derline, >140 mmHg = too high; LDL-cholesterol < 80 
mg/dL = excellent, 80-99 mg/dL = good, >100 mg/dL = 
too high (< 70 mg/dL = excellent in secondary 
prevention). 

Statistical considerations 

In this study, all patients treated by an investigator 
received the same type of follow up (either 



Table 1 OPTIMISE Study: Targets set for benchmarking 





SBP 


LDL-cholesterol 


HbA lc 




(mmHg) 


(mg/dL) 


(%) 


All countries 


< 130* 


< 100 + 


< 7.0 


Belgium 




< 80 + 





*Patients with proteinuria < 125 mmHg 

+ Patients with existing coronary heart disease < 70 mg/dL 



benchmarking or standard). Since each investigator may 
have a particular approach for monitoring, counselling 
and treating their patients with T2DM, it was expected 
that this might result in an investigator or cluster effect. 
The extent of this effect is reflected by the Intracluster 
Correlation Coefficient (ICC) which, according to the 
literature, varies for BP between 0.05 and 0.10, indicat- 
ing a clustering influence of 5-10% [19-21]. Of the three 
critical quality indicators used for benchmarking, SBP 
was expected to be the least well controlled at baseline. 
Therefore, for the purposes of study sample size calcula- 
tions, it was considered most important to be able to 
demonstrate significance in the improvement in propor- 
tion of patients achieving the pre-set targets for SBP. 

For example in the Best study [22], blood pressure 
was controlled in only 12.3% of patients at baseline; 
however, the proportion of patients with their blood 
pressure under control increased by 68.3% after 6 
months of more intensive follow up. Due to the stric- 
ter target (130/80 mmHg in the OPTIMISE study 
compared with 140/90 mmHg) and the longer duration 
of this study, an improvement of 88.7% could be 
expected in the proportion of patients reaching blood 
pressure targets in both groups, from 12.3% to 23.2% 
in the control group and the benchmarking group 
under the null hypothesis (the null hypothesis is that 
the percentages in both groups are equal). There are 
relatively few previous studies reporting the level of 
improvement seen when benchmarking was used. 
However, review of those available [23-27] suggest that 
the level of improvement that might be expected 
ranges from 15-51%. Based on these findings it was 
estimated that the use of benchmarking in OPTIMISE 
could further improve the level of blood pressure con- 
trol by 32%, i.e from 23.2 to 30.6% (the alternative 
hypothesis is that the percentage in the benchmarking 
group is larger than in the control group). With 3000 
evaluable patients in the benchmarking group (300 
clusters of 10 patients) and 1000 patients in the con- 
trol group (100 clusters of 10 patients), a two-sided Z 
test (unpooled) would give 91%, 86% and 80% power 
to detect a 74% difference between the proportion of 
patients on target for SBP for an ICC of 0.05, 0.07 and 
0.10 (between 5 and 10%). 

As some centres included up to 20 patients, sample 
size calculations were adapted. With 3000 evaluable 
patients in the benchmarking group (200 clusters of 15 
patients) and 1000 patients in the control group (67 
clusters of 15 patients), there would be 96%, 93% and 
88% power to detect a 74% difference between the pro- 
portion of patients on target for SBP for an ICC of 0.05, 
0.07 and 0.10, respectively, p = 0.05. With 3000 evalu- 
able patients in the benchmarking group (150 clusters of 
20 patients) and 1000 patients in the control group (50 
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A. At practice level 



Mean SBP value vs National Mean SBP values 



Visit 1 



1 1 1 

Mean value 155 
A Ranking 83/99 
W Percentile 83.8 




100 110 120 130 140 150 



160 170 



B. At patient level 



Patient's SBP value vs National Mean SBP values 



Patient repartition (8 patients) 




None of your patients are under control 

Patient repartition (8 patients) 




50% of your patients are under control 
Patient repartition (6 patients) 
17% I 




33% of your patients are under control 
Patient repartition (4 patients) 



75% 



25% of your patients are under control 








Patient's 
value 


Unit 

Change unit 


National 
mean value 


Visit 1 


/% 


160 


mmHg 


150 


Visit 2 


/% 


142 


mmHg 


140 


Visit 3 


/, 


135 


mmHg 


134 


Visit 4 




128 


mmHg 


120 



Visit 1 Visit 2 Visit 3 Visit 4 



Visit 1 




Visit 2 




Visit 3 




Visit 4 


4 



Attention: Systolic blood pressure 
is too high for this patient 

Attention: Systolic blood pressure 
is too high for this patient 

Attention: Systolic blood pressure 
is borderline for this patient 

Congratulation: Systolic blood pressure 
is good for this patient 



Figure 3 Example of benchmarking in OPTIMISE. Reproduced with permission from Whathealth.eu. SBP, systolic blood pressure. 



clusters of 20 patients), there would be 93%, 88% and Descriptive statistics (mean, median, number of obser- 

81% power to detect a 74% difference between the pro- vations, standard deviation, standard error, 95% confi- 

portion of patients on target for SBP for an ICC of 0.05, dence intervals - minimum and maximum) of all 

0.07 and 0.10, respectively, p = 0.05. primary and other variables will be tabulated. Given the 
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expected clustering of patients for each physician, multi- 
level mixed modelling will be performed using SAS 
GLINMIX for categorical variables (target attainment 
after 12 months in the benchmarking group compared 
with the control group, and target attainment after 12 
months compared with baseline levels) and SAS Prox 
MIX for continuous variables (mean percentage 
improvement of HbA lc , LDL-cholesterol and SBP after 
12 months follow up). 

Results 

Recruitment commenced on 6 March 2008 and finished 
31 December 2008. The database lock date was 31 
March 2010. A total of 4026 patients, of which 3994 are 
evaluable, were enrolled from 370 sites across six coun- 
tries in Europe. In the All Patients group, 55.1% were 
male. The benchmarking group comprised 2484 
patients; the control group comprised 1510 patients 
(Table 2). Baseline demographics were similar for both 
groups. Full results will be reported elsewhere. 

Discussion 

It is important to note that there is some confusion in 
the literature between the terms feedback and bench- 
marking. In the OPTIMISE study, benchmarking refers 
to the process of comparing critical quality indicators 
with the results of peers. Communicating these results 
to the physicians will be referred to as benchmarked 
feedback or feedback with benchmarking, to distinguish 
it from communicating the levels of the quality indica- 
tors alone to the physicians. 

There is emerging evidence from non-randomised 
controlled trials (non-RCTs) that feedback and bench- 
marking exert a positive effect on process, outcome and 
overall provision of diabetes care [28-31]. However, fol- 
low up over time without specific interventions has also 
demonstrated improvements in diabetes care [32,33]. 
Thus, it is difficult in non-RCTs to ascertain whether 
any detected improvements in care are due to the speci- 
fic intervention applied. 

There is limited evidence from RCTs on the effects of 
feedback on quality of care in T2DM in particular, and 



Table 2 OPTIMISE Study: Number of patients 
participating in each country 



Country 


Benchmarking 


Control 


All patients 


Belgium 


1146 


1044 


2190 


Greece 


570 


227 


797 


Luxembourg 


136 


62 


198 


Portugal 


134 


51 


185 


Spain 


257 


55 


312 


UK 


241 


71 


312 


Total 


2484 


1510 


3994 



in chronic care in general. Furthermore, evidence from 
RCTs on the effect of benchmarking is virtually non- 
existent. A recent systematic literature review [34] iden- 
tified 10 RCTs evaluating feedback only, but not bench- 
marking, in T2DM in primary care. The studies were 
very heterogeneous in aims and designs and a meta-ana- 
lysis of these trials was not considered feasible. A variety 
of outcome measures were studied. Feedback in the 
T2DM studies seemed to improve process markers such 
as foot examinations, eye examinations and HbA lc mea- 
surements. Clinical effect markers such as BP, levels of 
HbA lc and cholesterol were only assessed in three stu- 
dies, showing small positive effects on BP in three, on 
HbA lc in two, and on cholesterol in one study. Since 
this systematic review, two additional RCTs of feedback 
in T2DM (one using benchmarking) were published 
[35,36]. However, neither of these studies showed any 
effect on either process or outcome measures. There- 
fore, while some studies suggest that feedback can lead 
to a small improvement in testing rates, overall it seems 
that it rarely changes downstream clinical outcomes 
such as HbA lc levels. The evidence from the literature 
therefore suggests that clinical inertia is not driven by 
physicians' knowledge deficit, but is due to reluctance 
among physicians to make management changes. There- 
fore, benchmarking, using comparisons with peer per- 
formance, could potentially add an emotional- 
competitive drive to overcome this barrier. 

There are limited publications on the effects of bench- 
marking on quality of care in T2DM. A systematic 
review of controlled trials evaluating the effectiveness of 
various interventions to improve T2DM in primary care 
found that benchmarking improved diabetes care provi- 
sion in settings characterised by poor overall standard of 
care at baseline [37]. However, clinical outcomes were 
rarely assessed, and no statistically significant improve- 
ments were demonstrated. A meta-analysis investigating 
benchmarking in clinical care found a small but signifi- 
cant positive effect on outcomes [38]. This analysis indi- 
cated that the effectiveness of benchmarking was 
improved when the benchmarked feedback was written 
rather than verbal or graphical, and when it was pro- 
vided more frequently. In addition, providing combined 
group- and individual-level feedback appeared to have a 
positive effect. 

There are few published reports of randomised studies 
investigating the effectiveness of benchmarking in 
T2DM in primary care. In a small study comparing two 
types of benchmarking interventions, 70 community 
physicians in the USA were randomly assigned to 
receive a multi-modal quality improvement intervention, 
including physician-specific benchmarking feedback or 
an identical intervention plus Achievable Benchmarks of 
Care (ABC) feedback [23]. In the physician-specific 
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benchmarking feedback only group, the mean perfor- 
mance of the individual physicians on patient outcomes 
was compared with the performance of their anon- 
ymised peers. In the ABC feedback group, the individual 
performance of the physicians was compared with stan- 
dards of excellence attained by top performers in the 
peer group. Odds ratios (ORs) for the ABC group vs 
comparison physicians who delivered appropriate care 
after the intervention were 1.57 (95% confidence interval 
[CI], 1.26,1.96) for influenza vaccination, 1.33 (95% CI, 
1.05,1.69) for foot examination, and 1.33 (95% CI, 
1.04,1.69) for long-term glucose control measurement. 
For serum cholesterol and TG, the effect of ABC feed- 
back was not statistically significant. Another rando- 
mised trial that examined simultaneous benchmarking 
of US physicians and their patients failed to demonstrate 
improved safety or quality of care [36]. 

The paucity of controlled clinical trial evidence in 
T2DM provided the rationale for the initiation of OPTI- 
MISE. The original protocol was developed in Belgium, 
based on a long-standing large scale feedback and bench- 
marking project in specialised diabetes centres, the 
Initiative for the Promotion of Quality and Epidemiology 
of Diabetes mellitus (IQED)' [39]. The OPTIMISE trial 
resulted from an interest to carry this initiative into the 
primary care setting. Because an intervention tested in 
different healthcare systems would provide greater infor- 
mation value, the OPTIMISE study was extended to five 
other countries in Europe. The main differences between 
the OPTIMISE design and the Belgian IQED initiative 
are that OPTIMISE was (i) performed in a primary care 
setting, (ii) utilised a RCT design with a comparative con- 
trol group that did not receive benchmarking feedback, 
(iii) used a central laboratory, and (iv) focused on a smal- 
ler set of quality indicators. To minimise measurement 
variability, a central laboratory was identified for measur- 
ing HbA lc and lipids in the OPTIMISE trial. In real-life 
clinical practice, routine laboratory measurements could 
be used for benchmarking, provided the laboratories per- 
forming the analyses have suitable accredited quality 
assurance programmes, e.g. for HbA lc measurement [36] . 
Experience from the IQED initiative indicated that using 
a large dataset may somehow dilute the impact of bench- 
marking. The use of a limited number of critical quality 
indicators in OPTIMISE enabled the study investigators 
to provide benchmarking in a simple format, with suc- 
cinct, easily interpretable messages accompanied by an 
immediately actionable item. In addition, the IQED 
initiative provided feedback every 18 months, while in 
OPTIMISE feedback was given every 4 months, as more 
frequent feedback was shown to augment the effect on 
outcomes [40]. 

The OPTIMISE trial has several strengths. Contrary to 
many studies in the field of quality improvement, this 



study uses a monofactorial intervention allowing its 
potential effects to be easily studied and contextualised. 
It randomises at the clinic level and not at the patient 
level to avoid confounding effects from physicians hav- 
ing both intervention and control patients. Results are 
analysed at the patient level as this is more clinically 
meaningful. Importantly, the RCT design avoids any 
positive effect bias that may be induced by increased 
attention caused by participation in a study, or by 
selecting more motivated physicians, or by overall 
changes in T2DM care over the study period. A possible 
confounder of the study may be the frequency of deter- 
mining the quality indicators, which was pre-specified in 
the protocol. Frequency of testing may have increased 
for both groups of patients in the study compared with 
their testing frequency prior to study initiation. This 
alone may have led to an improvement in target attain- 
ment in both groups. One possible limitation of the 
study is that LDL-cholesterol levels were estimated 
using Friedewald's formula, rather than being directly 
measured. However, this reflects real-life clinical prac- 
tice. A further limitation may be the requirement to 
establish a history of proteinuria from patient medical 
records, as proteinuria may not be routinely tested for 
and recorded. In addition to investigating the effects of 
benchmarking, the OPTIMISE study offers an opportu- 
nity to obtain comparable data on the quality of primary 
care received by patients with T2DM in six European 
countries. It is difficult to obtain comparable data across 
different European countries due to heterogeneity in 
data collection. Therefore, the OPTIMISE data have the 
potential to offer a particularly useful resource for Eur- 
opean healthcare providers, and help inform diabetes 
care policy decisions at the European level along with 
other initiatives such as the European Core Indicators in 
Diabetes project (EUCID), which aims to create a plat- 
form to collect and analyse data on health status and 
care delivery for diabetes in Europe [41]. 

In summary, although feedback with benchmarking is 
a promising tool for quality improvement in chronic 
care in general, and in diabetes care specifically, there is 
a striking lack of clinical evidence from controlled trials. 
The results from OPTIMISE may have the potential to 
contribute to filling this gap. 
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